Abstract
CRISPR gene editing holds great promise to modify somatic genomes to ameliorate disease. In silico prediction of homologous sites coupled with biochemical evaluation of possible genomic off-targets may predict genotoxicity risk of individual gene editing reagents. However, standard computational and biochemical methods focus on reference genomes and do not consider the impact of genetic diversity on off-target potential. Here we developed a web application called CRISPRme that explicitly and efficiently integrates human genetic variant datasets with orthogonal genomic annotations to predict and prioritize off-target sites at scale. The method considers both single-nucleotide variants (SNVs) and indels, accounts for bona fide haplotypes, accepts spacer:protospacer mismatches and bulges, and is suitable for personal genome analyses. We tested the tool with a guide RNA (gRNA) targeting the BCL11A erythroid enhancer that has shown therapeutic promise in clinical trials for sickle cell disease (SCD) and β-thalassemia (Frangoul et al. NEJM 2021). We find that the top predicted off-target site is produced by a non-reference allele common in African-ancestry populations (rs114518452, minor allele frequency (MAF) = 4.5%) that introduces a protospacer adjacent motif (PAM) for SpCas9. We validate that SpCas9 generates indels (~9.6% frequency) and chr2 pericentric inversions in a strictly allele-specific manner in edited CD34+ hematopoietic stem/progenitor cells (HSPCs), although a high-fidelity Cas9 variant mitigates this off-target. This report illustrates how population and private genetic variants should be considered as modifiers of genome editing outcomes. We expect that variant-aware off-target assessment will be required for therapeutic genome editing efforts going forward, including both ongoing and future clinical trials, and we provide a powerful approach for comprehensive off-target prediction. CRISPRme is available at crisprme.di.univr.it.
No relevant conflicts of interest to declare.